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Abstract 

The uniform distribution is an important counterexample in game 
theory as many of the canonical game dynamics have been shown not 
to converge to the equilibrium in certain cases. In particular none of 
the canonical game dynamics converge to the uniform distribution in 
a form of rock-paper-scissors where the amount an agent can lose is 
more than the agent can win, despite this fact, it is the unique Nash 
equilibrium. I will show that certain incentive dynamics are asymp- 
totically stable at the uniform distribution when it is an incentive 
equilibrium. 
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Incentive dynamics ["^ryl2] are given by 

where (p{x) is the a vahd incentive for the game. It was shown that if the in- 
centive for a finite game is continuous, there exists a fixed point characterized 
by 

(fia{x) = Xia ^ ^ii3{x) Va, i. 

Notice that if this occurs at the uniform distribution, either (fia{x) are aU 
zero, or they are aU the same for each agent. 

Nash's original incentive function is fixed if and only if all the component 
incentives are zero and thus it can only be in the first case described above. 
In contrast, the incentive function given by ^f^ii^) = ^^(<^a7 — Ui{x))^ is 
only zero when Ui{x) > max^ 2/^(6^^, e^), where G S-i which can occur at 
the uniform distribution only if the game is constant, which is a degenerate 
case of little interest. Despite their diflFerences we will demonstrate that the 
two can agree under certain circumstances. Also, we will see that the latter 
incentive is globally asymptotically stable at a uniform Nash equilibrium 
where the canonical dynamics fail to converge. 



0.1 A Bad Game of Rock-Paper-Scissors 



The standard game of Rock-Paper-Scissors (RPS) is given as a two person 
zero sum game with payoffs given in the table below on the left. 
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To the right of the RPS payoffs we have a generalized RPS with a and b both 
positive. The case when 6 > a, or an agent can lose more than it can win, 
is an important example of a game. The unique Nash equilibrium for this 
game is the uniform distribution. We have seen many examples of incentive 
dynamics that have Nash equilibrium as their interior fixed points, such as 
the replicator equations, projection dynamics, the logit equations, best reply 
dynamics, and the Brown-von Neumann-Nash equations. However, in every 
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one of these cases the dynamics do not converge to the unique equihbrium 
as shown in the figures below^ This leads us to the natural question: does 
any incentive dynamic converge to a rest point from any initial point? 

1 Agreement Among Incentives 

We note the incentive ^f^ii^) = X]^(<^a7 — Ui{x))^ can be rewritten in the 
form (ffi{x) = Y.^{ui{ea,x_i) - Ui{x) + a^^ - t^i(ec., which shows 

that it is similar to a Nash comparison in that we are checking the payoflF 
given the other agents' strategies are fixed. However, we are tempering that 
comparison by taking away the amount by which the agent is not receiving 
a preferred payoflF available in the game. We will now show there is a class 
of games, which includes general RPS, with the property that the uniform 
distribution is a Nash equilibrium as well as an incentive equilibrium for 
(p^{x). First we will need the following lemma. 

Lemma 1. If Ai is the payoff matrix for the ith agent ^ then x is a Nash equi- 
librium where each agent is using the uniform distribution over its strategies 
if and only if for each i, Ai has an equal sum across rows. 

Proof. We begin by noting that for an interior Nash equilibrium we must 
have 

Ui{ei,x_i) = . . . = Ui{es^,x_i), Mi. 

It should suffice then to calculate the value of just one of the Ui{ea^ x^i). We 

-^The images were produced using the Dynamo Mathematica package developed by 
Sandholm, Dokumaci, and Franchetti [SDFll]. Colors indicate speed: blue is slowest and 
red is fastest 



3 



will use the n-linearity of the payoffs to complete the task. 

ui{ei,x-i) = —Ui{ei,ej^,Xs,...,Xn) (1) 

— 1 

1 

= —^ui{ei,ej^,xs,...,xn) (2) 



S2 ^ . 



^ ^ ^ ^ '^1 (^1 5 5 5 • • • 5 ^n) (3) 



^2 = 1 J3 = i- 
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X] • • • Meuej^^ej^, . . . , e^J (4) 
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which is exactly the average of the coefficients in the first row of Ai. Thus 
for any agent i we have the equalities 

which after cancellation of the non-zero term proves our assertion. □ 

Proposition 2. // uniform distribution^ x ^ is a Nash equilibrium and 
in each of the payoff matrices the sums of the elements in each row that 
are larger than the average are equal, then it is an incentive equilibrium for 

Proof We will use the above lemma to prove the assertion. Given that the 
rows must all have an equal sum, the average of the elements in A^, which 
we will denote a^, is equal to ^ an^. Let us now consider the condition 
for an incentive equilibrium when our incentive is given by (f^{x). At a Nash 
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equilibrium we have the following calculation for each agent i 
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= ^{a^^ - Ui{eia, x_i))+ (7) 



(8) 



+ 



= ^{aa^ - a^)+ (9) 



where the second line is justified since x is a Nash equilibrium and thus 
Uiieia^X-i) — Ui{x). The last line is simply the sum of all the elements 
from row a that are larger than the average. Given our assumption, it 
must be the case that (pf^{x) — ^^(x) for every a and /?. Thus we have 
^Fai^) = i" every agent i, which is true if and only if x is an 

incentive equilibrium. □ 

To summarize, we found a class of games where the Nash equilibrium co- 
incides with the incentive equilibrium for (f^{x) at the uniform distribution. 
All RPS games have the property that the rows of the payoff matrices are 
permutations of the first row. Games with this property form a subset of the 
games where the Nash equilibrium and our incentive equilibrium agree. 

We conjecture that this is the only agreement outside of constant games 
and strategies where players are receiving their respective maximum pay- 
off. There are simple counterexamples when either of the conditions is 

/I 0\ 

dropped. For example, if ^4^ = 10 1, the average is 0, but the 

VO -3 1/ 

sums across rows are not equal. The interior Nash equilibrium is x = 
((1/6, 1/6,2/3), (1/6, 1/6,2/3)) while the incentive equilibrium for ip^ is the 

/ 1 2 \ 

uniform distribution. On the other hand, if Ai — I ^ ^ 1 then the Nash 

equilibrium is the uniform distribution, but the incentive equilibrium is x 
((0.31, 0.69), (0.31, 0.69)). 
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2 Asymptotic Stability 



As we have seen, many of the dynamics that have Nash equihbria as fixed 
points do not necessarily converge to the uniform distribution. The specific 
examples that do (at least so far) have been Rock-Paper-Scissors type games. 
We notice that the main idea is to create a cycle of best replies by permuting 
the values in the first row of the payoff matrix. This cyclic behavior is 
essentially the problem with convergence. We will now show that changing 
the parameters while maintaining this type of cyclic payoff structure has no 
impact on the asymptotic stability of the incentive equilibrium for (p^(x). 

Proposition 3. // the rows of the payoff matrix Ai are permutations of each 
other, (pf^{x) — (pf^p{x) for all x ^ A and either (pf^{x) — or Xi^ — j: for 
every a at incentive equilibrium. 

Proof Denote a as the permutation that takes row a to row /5; then every 
element in row /3 can be written as a^^^ = ai^aik) for some k ^ Si. Thus 

= ^7(^^/37 - = Y^ki^iaaik) - Ui{x))^ = (p^^{x) regardless of x. 

We can now use this fact to describe all possible incentive equilibria for 
(p^{x). By definition, at equilibrium f, (p^^^^x) = for every 

i and every a. Given that the incentive functions are all equal regardless 
of X G A, we must have (pf^{x) — XiaSi(pf^{x)^ which is true if and only if 
(ff^{x) — for all Of, which can occur only at the boundary or in a degenerate 
game, or when Xia — — for every a. □ 

Recall the definition of an ISS is 

d^i • ^ j^i 

Xi Xi 

for dl\ X ^ X in some neighborhood of x. Also, an ISS is asymptotically 
stable wherever it satisfies the inequality in the definition. It will suffice then 
to prove that the uniform distribution is an ISS and the entire space is its 
basin of attraction. 

Theorem 4. If x is a uniform incentive equilibrium for (p^{x) and the payoff 
matrices have rows that are permutations of each other, then x is globally 
asymptotically stable in intA for the incentive dynamics. 

Proof. The previous proposition gives us that the incentives are equal for all 
a so we can without loss of generality use only (p^{x) for each i. 
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If we define /(x) = ^ it is easy to show that f{x) has a global 
minimum of s'f when Xi is the uniform distribution. We simply optimize 
using Lagrange multipliers, noting that the Hessian matrix of f{x) is positive 
definite in the interior of A. Thus x satisfies the ISS definition for all x G 
intA. □ 

We further conjecture that all interior incentive equilibrium are asymp- 
totically stable. If this is true, we can reduce the open problem of finding 
a game dynamic where every orbit converges to a rest point to proving that 
the basins of attraction for the incentive equilibrium form a partition of A. 
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(a) BNN 



(b) Logit (0.2) 



A 




(c) Smith 

Figure 1: Stable limit cycles in Bad RPS 
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(b) bad RPS 



(a) standard RPS 



Figure 2: The replicator dynamics display invariant limit cycles and unstable 
equilibrium points in the RPS 




(a) bad RPS 



(b) RPS 




(c) good RPS 

Figure 4: Global asymptotic stability of the uniform distribution in the si- 
multaneous updating dynamics 
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